Overview

Dataset Statistics

Number of Variables 36
Number of Rows 56046
Missing Cells 27638
Missing Cells (%) 1.4%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 97.9 MB
Average Row Size in Memory 1.8 KB
Variable Types
  • Categorical: 28
  • Numerical: 6
  • GeoPoint: 1
  • GeoGraphy: 1

Dataset Insights

ProductColor has 26878 (47.96%) missing values Missing
ProductKey is skewed Skewed
TerritoryKey is skewed Skewed
ProductSubcategoryKey is skewed Skewed
ProductCost is skewed Skewed
ProductPrice is skewed Skewed
OrderDate has a high cardinality: 911 distinct values High Cardinality
StockDate has a high cardinality: 1001 distinct values High Cardinality
OrderNumber has a high cardinality: 25164 distinct values High Cardinality
FirstName has a high cardinality: 660 distinct values High Cardinality
LastName has a high cardinality: 365 distinct values High Cardinality
BirthDate has a high cardinality: 8072 distinct values High Cardinality
EmailAddress has a high cardinality: 17416 distinct values High Cardinality
ProductSKU has a high cardinality: 130 distinct values High Cardinality
ProductName has a high cardinality: 130 distinct values High Cardinality
OrderNumber has constant length 7 Constant Length
OrderLineItem has constant length 1 Constant Length
OrderQuantity has constant length 1 Constant Length
MaritalStatus has constant length 1 Constant Length
Gender has constant length 1 Constant Length
TotalChildren has constant length 1 Constant Length
HomeOwner has constant length 1 Constant Length
ProductStyle has constant length 1 Constant Length
ProductCategoryKey has constant length 1 Constant Length
  • 1
  • 2
  • 3

Variables


OrderDate

categorical

Approximate Distinct Count 911
Approximate Unique (%) 1.6%
Missing 0
Missing (%) 0.0%
Memory Size 4144640

Length

Mean 8.9507
Standard Deviation 0.6294
Median 9
Minimum 8
Maximum 10

Sample

1st row 1/1/2015
2nd row 2/6/2015
3rd row 7/27/2015
4th row 8/27/2015
5th row 9/9/2015

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 389558

StockDate

categorical

Approximate Distinct Count 1001
Approximate Unique (%) 1.8%
Missing 0
Missing (%) 0.0%
Memory Size 4145878

Length

Mean 8.9728
Standard Deviation 0.6389
Median 9
Minimum 8
Maximum 10

Sample

1st row 9/21/2001
2nd row 10/12/2001
3rd row 6/16/2002
4th row 6/21/2002
5th row 5/20/2002

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 390796

OrderNumber

categorical

Approximate Distinct Count 25164
Approximate Unique (%) 44.9%
Missing 0
Missing (%) 0.0%
Memory Size 4035312

Length

Mean 7
Standard Deviation 0
Median 7
Minimum 7
Maximum 7

Sample

1st row SO45080
2nd row SO45383
3rd row SO46896
4th row SO47311
5th row SO47507

Letter

Count 112092
Lowercase Letter 0
Space Separator 0
Uppercase Letter 112092
Dash Punctuation 0
Decimal Number 280230
  • OrderNumber has words of constant length

ProductKey

numerical

Approximate Distinct Count 130
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 896736
Mean 438.9621
Minimum 214
Maximum 606
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • ProductKey is skewed left (γ1 = -0.8325)

Quantile Statistics

Minimum 214
5-th Percentile 215
Q1 360
Median 479
Q3 529
95-th Percentile 582
Maximum 606
Range 392
IQR 169

Descriptive Statistics

Mean 438.9621
Standard Deviation 118.6124
Variance 14068.8901
Sum 2.4602e+07
Skewness -0.8325
Kurtosis -0.6063
Coefficient of Variation 0.2702
  • ProductKey is not normally distributed (p-value 1.8128022591242458e-12)

CustomerKey

numerical

Approximate Distinct Count 17416
Approximate Unique (%) 31.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 896736
Mean 18843.6456
Minimum 11000
Maximum 29483
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • CustomerKey is skewed right (γ1 = 0.2773)

Quantile Statistics

Minimum 11000
5-th Percentile 11425
Q1 14016
Median 18157
Q3 23425.75
95-th Percentile 28032
Maximum 29483
Range 18483
IQR 9409.75

Descriptive Statistics

Mean 18843.6456
Standard Deviation 5412.4498
Variance 2.9295e+07
Sum 1.0561e+09
Skewness 0.2773
Kurtosis -1.166
Coefficient of Variation 0.2872
  • CustomerKey is not normally distributed (p-value 2.676285170470888e-07)

TerritoryKey

numerical

Approximate Distinct Count 10
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 896736
Mean 6.2547
Minimum 1
Maximum 10
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • TerritoryKey is skewed left (γ1 = -0.489)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 4
Median 7
Q3 9
95-th Percentile 10
Maximum 10
Range 9
IQR 5

Descriptive Statistics

Mean 6.2547
Standard Deviation 2.958
Variance 8.7498
Sum 350549
Skewness -0.489
Kurtosis -0.9888
Coefficient of Variation 0.4729
  • TerritoryKey is not normally distributed (p-value 2.929993446383601e-10)

OrderLineItem

categorical

Approximate Distinct Count 8
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3699036

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 56046
  • The top 2 categories (1, 2) take over 50.0%
  • OrderLineItem has words of constant length

OrderQuantity

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3699036

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 56046
  • The top 2 categories (1, 2) take over 50.0%
  • OrderQuantity has words of constant length

Prefix

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 380
Missing (%) 0.7%
Memory Size 3805076

Length

Mean 3.3555
Standard Deviation 0.4787
Median 3
Minimum 3
Maximum 4

Sample

1st row MR.
2nd row MR.
3rd row MS.
4th row MS.
5th row MR.

Letter

Count 131120
Lowercase Letter 0
Space Separator 0
Uppercase Letter 131120
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (MR., MRS.) take over 50.0%

FirstName

categorical

Approximate Distinct Count 660
Approximate Unique (%) 1.2%
Missing 0
Missing (%) 0.0%
Memory Size 3979942

Length

Mean 5.9337
Standard Deviation 1.4184
Median 6
Minimum 2
Maximum 11

Sample

1st row JOHN
2nd row KEVIN
3rd row KAYLA
4th row JADA
5th row BRANDON

Letter

Count 332401
Lowercase Letter 0
Space Separator 2
Uppercase Letter 332401
Dash Punctuation 2
Decimal Number 0

LastName

categorical

Approximate Distinct Count 365
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Memory Size 3960154

Length

Mean 5.5592
Standard Deviation 1.8089
Median 6
Minimum 2
Maximum 16

Sample

1st row THOMAS
2nd row EDWARDS
3rd row RUSSELL
4th row MURPHY
5th row THOMPSON

Letter

Count 311390
Lowercase Letter 0
Space Separator 6
Uppercase Letter 311390
Dash Punctuation 3
Decimal Number 0

BirthDate

categorical

Approximate Distinct Count 8072
Approximate Unique (%) 14.4%
Missing 0
Missing (%) 0.0%
Memory Size 4142445

Length

Mean 8.9115
Standard Deviation 0.6304
Median 9
Minimum 8
Maximum 10

Sample

1st row 11/11/1958
2nd row 11/11/1953
3rd row 8/15/1974
4th row 2/14/1980
5th row 7/21/1945

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 387363

MaritalStatus

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3699036

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row M
2nd row M
3rd row M
4th row M
5th row M

Letter

Count 56046
Lowercase Letter 0
Space Separator 0
Uppercase Letter 56046
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (M, S) take over 50.0%
  • MaritalStatus has words of constant length

Gender

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 380
Missing (%) 0.7%
Memory Size 3673956

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row M
2nd row M
3rd row F
4th row F
5th row M

Letter

Count 55666
Lowercase Letter 0
Space Separator 0
Uppercase Letter 55666
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (M, F) take over 50.0%
  • Gender has words of constant length

EmailAddress

categorical

Approximate Distinct Count 17416
Approximate Unique (%) 31.1%
Missing 0
Missing (%) 0.0%
Memory Size 5200482

Length

Mean 27.6557
Standard Deviation 1.4841
Median 28
Minimum 22
Maximum 33

Sample

1st row john48@adventure-w...
2nd row kevin38@adventure-...
3rd row kayla44@adventure-...
4th row jada6@adventure-wo...
5th row brandon36@adventur...

Letter

Count 1285171
Lowercase Letter 1285171
Space Separator 0
Uppercase Letter 0
Dash Punctuation 56048
Decimal Number 96539

AnnualIncome

categorical

Approximate Distinct Count 16
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 4098816

Length

Mean 8.1331
Standard Deviation 0.3397
Median 8
Minimum 8
Maximum 9

Sample

1st row $80,000
2nd row $30,000
3rd row $60,000
4th row $40,000
5th row $60,000

Letter

Count 0
Lowercase Letter 0
Space Separator 56046
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 287688

TotalChildren

categorical

Approximate Distinct Count 6
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3699036

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 0
4th row 0
5th row 4

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 56046
  • TotalChildren has words of constant length

EducationLevel

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 4361272

Length

Mean 12.8159
Standard Deviation 3.1763
Median 15
Minimum 9
Maximum 19

Sample

1st row Partial College
2nd row Bachelors
3rd row Partial College
4th row High School
5th row Bachelors

Letter

Count 674819
Lowercase Letter 575310
Space Separator 43463
Uppercase Letter 99509
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Bachelors, Partial College) take over 50.0%

Occupation

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 4247786

Length

Mean 10.7911
Standard Deviation 2.6235
Median 12
Minimum 6
Maximum 14

Sample

1st row Skilled Manual
2nd row Skilled Manual
3rd row Professional
4th row Skilled Manual
5th row Management

Letter

Count 591656
Lowercase Letter 522470
Space Separator 13140
Uppercase Letter 69186
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Professional, Skilled Manual) take over 50.0%

HomeOwner

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3699036
  • The largest value (Y) is over 2.23 times larger than the second largest value (N)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row N
2nd row Y
3rd row Y
4th row Y
5th row Y

Letter

Count 56046
Lowercase Letter 0
Space Separator 0
Uppercase Letter 56046
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Y, N) take over 50.0%
  • The largest value (y) is over 2.23 times larger than the second largest value (n)
  • HomeOwner has words of constant length

ProductSubcategoryKey

numerical

Approximate Distinct Count 17
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 896736
Mean 23.413
Minimum 1
Maximum 37
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • ProductSubcategoryKey is skewed left (γ1 = -0.7004)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 19
Median 28
Q3 37
95-th Percentile 37
Maximum 37
Range 36
IQR 18

Descriptive Statistics

Mean 23.413
Standard Deviation 13.4932
Variance 182.0674
Sum 1.3122e+06
Skewness -0.7004
Kurtosis -1.0483
Coefficient of Variation 0.5763
  • ProductSubcategoryKey is not normally distributed (p-value 1.9471919521977113e-15)

ProductSKU

categorical

Approximate Distinct Count 130
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 4100268

Length

Mean 8.159
Standard Deviation 1.3257
Median 7
Minimum 7
Maximum 10

Sample

1st row BK-R50B-58
2nd row BK-R50B-58
3rd row BK-R50B-58
4th row BK-R50B-58
5th row BK-R50B-58

Letter

Count 180728
Lowercase Letter 0
Space Separator 0
Uppercase Letter 180728
Dash Punctuation 81212
Decimal Number 195338

ProductName

categorical

Approximate Distinct Count 130
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 4728310

Length

Mean 19.3648
Standard Deviation 4.0804
Median 21
Minimum 12
Maximum 31

Sample

1st row Road-650 Black, 58
2nd row Road-650 Black, 58
3rd row Road-650 Black, 58
4th row Road-650 Black, 58
5th row Road-650 Black, 58

Letter

Count 784534
Lowercase Letter 613668
Space Separator 127769
Uppercase Letter 170866
Dash Punctuation 35551
Decimal Number 102481

ModelName

categorical

Approximate Distinct Count 40
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 4406140
  • The largest value (Sport-100) is over 1.51 times larger than the second largest value (Water Bottle)

Length

Mean 13.6165
Standard Deviation 4.7659
Median 12
Minimum 8
Maximum 27

Sample

1st row Road-650
2nd row Road-650
3rd row Road-650
4th row Road-650
5th row Road-650

Letter

Count 609221
Lowercase Letter 484504
Space Separator 60647
Uppercase Letter 124717
Dash Punctuation 30023
Decimal Number 62315

ProductDescription

categorical

Approximate Distinct Count 40
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 7838759
  • The largest value (Universal fit, well-vented, lightweight , snap-on visor.) is over 1.51 times larger than the second largest value (AWC logo water bottle - holds 30 oz; leak-proof.)

Length

Mean 74.863
Standard Deviation 43.1933
Median 56
Minimum 18
Maximum 200

Sample

1st row Value-priced bike ...
2nd row Value-priced bike ...
3rd row Value-priced bike ...
4th row Value-priced bike ...
5th row Value-priced bike ...

Letter

Count 3397052
Lowercase Letter 3291141
Space Separator 572138
Uppercase Letter 105911
Dash Punctuation 64724
Decimal Number 25713

ProductColor

categorical

Approximate Distinct Count 7
Approximate Unique (%) 0.0%
Missing 26878
Missing (%) 48.0%
Memory Size 2037854
  • The largest value (Black) is over 2.0 times larger than the second largest value (Yellow)

Length

Mean 4.8661
Standard Deviation 0.9656
Median 5
Minimum 3
Maximum 6

Sample

1st row Black
2nd row Black
3rd row Black
4th row Black
5th row Black

Letter

Count 141934
Lowercase Letter 112766
Space Separator 0
Uppercase Letter 29168
Dash Punctuation 0
Decimal Number 0
  • The largest value (black) is over 2.0 times larger than the second largest value (yellow)

ProductSize

categorical

Approximate Distinct Count 19
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3714424
  • The largest value (0) is over 17.15 times larger than the second largest value (M)

Length

Mean 1.2746
Standard Deviation 0.4463
Median 1
Minimum 1
Maximum 2

Sample

1st row 58
2nd row 58
3rd row 58
4th row 58
5th row 58

Letter

Count 7212
Lowercase Letter 0
Space Separator 0
Uppercase Letter 7212
Dash Punctuation 0
Decimal Number 64222
  • The top 2 categories (0, M) take over 50.0%
  • The largest value (0) is over 17.15 times larger than the second largest value (m)

ProductStyle

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3699036
  • The largest value (0) is over 1.8 times larger than the second largest value (U)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row U
2nd row U
3rd row U
4th row U
5th row U

Letter

Count 22439
Lowercase Letter 0
Space Separator 0
Uppercase Letter 22439
Dash Punctuation 0
Decimal Number 33607
  • The top 2 categories (0, U) take over 50.0%
  • The largest value (0) is over 1.8 times larger than the second largest value (u)
  • ProductStyle has words of constant length

ProductCost

numerical

Approximate Distinct Count 41
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 896736
Mean 255.7504
Minimum 0.8565
Maximum 2171.2942
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • ProductCost is skewed right (γ1 = 2.009)

Quantile Statistics

Minimum 0.8565
5-th Percentile 0.8565
Q1 2.9733
Median 11.2163
Q3 59.466
95-th Percentile 1320.6838
Maximum 2171.2942
Range 2170.4377
IQR 56.4927

Descriptive Statistics

Mean 255.7504
Standard Deviation 496.1929
Variance 246207.4141
Sum 1.4334e+07
Skewness 2.009
Kurtosis 3.1181
Coefficient of Variation 1.9401
  • ProductCost is not normally distributed (p-value 8.315136548143683e-25)
  • ProductCost has 13929 outliers

ProductPrice

numerical

Approximate Distinct Count 40
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 896736
Mean 438.9693
Minimum 2.29
Maximum 3578.27
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • ProductPrice is skewed right (γ1 = 1.9652)

Quantile Statistics

Minimum 2.29
5-th Percentile 2.29
Q1 7.95
Median 29.99
Q3 159
95-th Percentile 2181.5625
Maximum 3578.27
Range 3575.98
IQR 151.05

Descriptive Statistics

Mean 438.9693
Standard Deviation 838.65
Variance 703333.9045
Sum 2.4602e+07
Skewness 1.9652
Kurtosis 2.8152
Coefficient of Variation 1.9105
  • ProductPrice is not normally distributed (p-value 8.164864083530448e-25)
  • ProductPrice has 13929 outliers

SubcategoryName

categorical

Approximate Distinct Count 17
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 4303197
  • The largest value (Tires and Tubes) is over 2.12 times larger than the second largest value (Bottles and Cages)

Length

Mean 11.7797
Standard Deviation 4.1045
Median 14
Minimum 4
Maximum 17

Sample

1st row Road Bikes
2nd row Road Bikes
3rd row Road Bikes
4th row Road Bikes
5th row Road Bikes

Letter

Count 598013
Lowercase Letter 503290
Space Separator 62194
Uppercase Letter 94723
Dash Punctuation 0
Decimal Number 0

ProductCategoryKey

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3699036
  • The largest value (4) is over 2.41 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 56046
  • The top 2 categories (4, 1) take over 50.0%
  • The largest value (4) is over 2.41 times larger than the second largest value (1)
  • ProductCategoryKey has words of constant length

CategoryName

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 4150392
  • The largest value (Accessories) is over 2.41 times larger than the second largest value (Bikes)

Length

Mean 9.0533
Standard Deviation 2.5542
Median 11
Minimum 5
Maximum 11

Sample

1st row Bikes
2nd row Bikes
3rd row Bikes
4th row Bikes
5th row Bikes

Letter

Count 507402
Lowercase Letter 451356
Space Separator 0
Uppercase Letter 56046
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Accessories, Bikes) take over 50.0%
  • The largest value (accessories) is over 2.41 times larger than the second largest value (bikes)

Region

categorical

Approximate Distinct Count 10
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 4132559

Length

Mean 8.7351
Standard Deviation 2.2626
Median 9
Minimum 6
Maximum 14

Sample

1st row Northwest
2nd row Northwest
3rd row Northwest
4th row Northwest
5th row Northwest

Letter

Count 483146
Lowercase Letter 420677
Space Separator 6423
Uppercase Letter 62469
Dash Punctuation 0
Decimal Number 0

Country

categorical

Approximate Distinct Count 6
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 4211843
  • The largest value (United States) is over 1.6 times larger than the second largest value (Australia)

Length

Mean 10.1498
Standard Deviation 3.0857
Median 9
Minimum 6
Maximum 14

Sample

1st row United States
2nd row United States
3rd row United States
4th row United States
5th row United States

Letter

Count 542619
Lowercase Letter 460339
Space Separator 26234
Uppercase Letter 82280
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (United States, Australia) take over 50.0%

Continent

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 4178477
  • The largest value (North America) is over 1.57 times larger than the second largest value (Europe)

Length

Mean 9.5544
Standard Deviation 3.3044
Median 7
Minimum 6
Maximum 13

Sample

1st row North America
2nd row North America
3rd row North America
4th row North America
5th row North America

Letter

Count 508801
Lowercase Letter 426069
Space Separator 26686
Uppercase Letter 82732
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (North America, Europe) take over 50.0%

Interactions

Correlations

Missing Values